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ABSTRACT 

RNA-dependent RNA polymerase RDR6 is involved 
in the biogenesis of plant frans-acting siRNAs. 
This process Is initiated by miRNA-directed and 
Argonaute (AGO) protein-mediated cleavage of 
TAS gene transcripts. One of the cleavage products 
is converted by RDR6 to double-stranded (ds)RNA, 
the substrate for DIcer-IIke 4 (DGL4). Interestingly, 
TAS3 transcript contains two target sites for 
miR390::AGO7 complexes, 5'-non-cleavable and 
3'-cleavable. Here we show that RDR6-mediated syn- 
thesis of complementary RNA starts at a third nucleo- 
tide of the cleaved TAS3 transcript and is terminated 
by the miR390::AGO7 complex stably bound to the 
non-cleavable site. Thus, the resulting dsRNA has a 
short, 2-nt, 3'-overhang and a long, 220-nt, 
5'-overhang of the template strand. The short, but 
not long, overhang is optimal for DCL4 binding, 
which ensures dsRNA processing from one end into 
phased siRNA duplexes with 2-nt 3'-overhangs. 

INTRODUCTION 

The biogenesis of plant /ra«.y-acting short interfering (si) 
RNAs (tasiRNAs) is initiated by microRNA (miRNA)- 
directed and Argonaute (AGO) protein-mediated 
cleavage of a primary RNA (pri-RNA) transcribed by 
Pol II from a TAS gene. One of the pri-RNA cleavage 
products is fully or partially converted by RNA- 
dependent RNA polymerase (RDR) 6 (RDR6) to 
double-stranded RNA (dsRNA). The dsRNA is processed 
by Dicer-hke 4 (DCL4) from the miRNA cleavage 
terminus into several consecutive 21-nt siRNA duplexes 
with 2-nt 3'-overhangs (1,2). One of the duplex strands 
(mature tasiRNA) binds an AGO protein and guides the 
resulting RNA-induced silencing complex (RISC) to com- 
plementary target mRNAs. The RISC complex inter- 
action with target mRNA leads to its silencing through 
AGO-mediated cleavage or translational repression. 

In Arabidopsis thaliana, three miRNAs are known to 
initiate tasiRNA biogenesis: miR173, miR390 and 



miR828 (1,3). The miR173-dependent tasiRNAs are 
encoded in TASla/b/c and TAS2 genes and target 
several genes of the pentatricopeptide repeat gene family 
and those of unknown function. The miR828-dependent 
tasiRNA is encoded in TAS4 and targets three 
members of MYB (myeloblastosis) transcription factor 
gene family. The miR390-dependent tasiRNAs, also 
called tasiARFs, are encoded in three TAS3 family 
genes (3a, 3b and 3c) and target the genes from auxin 
response factor {ARF) family including ARF3/ETTIN 
and ARF4 that regulate plant development. Only TAS3- 
derived tasiARF species are conserved in higher plants. 
Here we used TAS3a, the most highly expressed TAS3 
family gene (4), to address some of the unknowns of 
tasiRNA biogenesis and, specifically, to determine the 
structures of RDR6-dependent dsRNA precursors of 
tasiARFs. 

To guide cleavage of TAS3a/b/c pri-RNAs and thereby 
initiate tasiARF biogenesis, niiR390 binds AG07 to 
form a miR390-AGO7 complex with sheer activity (5). 
Importantly, all the three TAS3 pri-RNAs carry a 
second, non-cleavable, miR390 complementary site, 
190-230 nt upstream of the miR390 cleavage site (4) 
(Figure lA). The non-cleavable interaction of the 
miR390-AGO7 complex with TAS3a pri-RNA is abso- 
lutely necessary for RDR6-dependent tasiARF produc- 
tion: the non-cleavable site sequence could neither 
tolerate mutations that destabilize the complementary 
interaction, nor be functionally substituted by unrelated 
target sequences fully or partially complementary to a 
heterologous miRNA, e.g. AGO 1 -associated niiR171a 
(5,6). In contrast, the miR390 cleavage site sequence 
could be functionally replaced with the miR171 cleavage 
site sequence (5). The specific function of the miR390- 
AG07 complex at the non-cleavable site is not known. 
It has been proposed that a stable interaction of this 
complex with cleaved TAS3 pri-RNAs passively or actively 
recruits RDR6 to the cleaved pri-RNA 3'-terminus, where 
complementary RNA synthesis is to be initiated (5). The 
fact that the region of all the three TAS3 genes between 
the two target sites spawns the majority of 
RDR6-dependent siRNAs including tasiARFs (4) led us 
to propose that RDR6 synthesis of complementary RNA 
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Figure 1. Model for the biogenesis of a dsRNA precursor of TAS3a tasiRNA and cRT-PCR sequencing of tlie dsRNA termini. (A) Diagrammatic 
representation of TAS3 dsRNA biogenesis. TAS3a gene is transcribed by Pol II into a capped and polyadenylated pri-RNA, followed by splicing. 
The spliced pri-RNA is associated with two miR390-AGO7 complexes at the respective upstream (non-cleavable) and downstream (cleavable) target 
sites. The downstream complex cleaves pri-RNA and dissociates from the 5' -cleavage product, the template for RDR6. RDR6 is recruited to the 
3'-end of the template and initiates complementary RNA synthesis. The upstream miR390-AGO7 complex stably bound to the template terminates 
RDR6 synthesis. The sequences surrounding the TASSa transcription start and miR390 target sites are shown in the expanded regions. The major 
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is terminated by the miR390-AGO7 complex stably 
bound to the template RNA. Here, we tested this hypoth- 
esis by sequencing of RDR6-synthesized complementary 
RNA derived from TAS3a gene. 



MATERIALS AND METHODS 

Seedhngs of Arabidopsis wild-type (Col-0 and La-er) and 
mutant lines dcl2/3/4 (7), zip-2 [ago7; (8,9)] and rdr6-15 (1) 
were grown and infected with Cauliflower mosaic virus 
(CaMV) as described previously (7). The Arabidopsis 
transactivator/viroplasmin (TAV) transgenic line D4-2 
was generated and characterized earlier (10,11). Our pre- 
viously described protocols were used for total RNA prep- 
aration (12), high resolution blot hybridization (11) and 
circularization-reverse transcription-PCR (cRT-PCR) 
[(13,14); also see 'Protocol' in Supplementary Data]. The 
hybridization probes and cRT-PCR primers are hsted in 
Supplementary Table SI. 



RESULTS AND DISCUSSION 

Under normal conditions, dsRNA precursors of 
tasiRNAs are rapidly processed by DCL4 and therefore, 
accumulate to the levels barely detectable by RNA blot 
hybridization. To characterize the structures of TAS3a- 
derived dsRNA(s) we made use of Arabidopsis Col-0 and 
La-er plants infected with CaMV or transgenic Col-0 
plants expressing a CaMV TAV protein. Both CaMV in- 
fection and TAV expression stabilize dsRNAs derived 
from TASla/b/c, TAS2 and TAS3a genes (11). In the 
case of TAS3a, transgene expression of TAV derived 
from CaMV strain D4 (TAV-D4) had the strongest sta- 
bilization effect (11). Here, we performed a higher reso- 
lution blot hybridization analysis of TAS3a-denYed long 
sense and antisense RNAs using DNA oligonucleotide 
probes specific for TAS3a siRNAs 5'D7(+) and 5'D7(-), 
i.e. sense and antisense strands of the 5'D7 duplex 
located between the two miR390 target sites. In sense 
polarity, a very abundant RNA of ~460nt and several 
shorter, less abundant RNAs ranging from 140 nt to 
330 nt accumulated in TAV-D4 plants. These RNAs 
accumulated at much lower levels in CaMV-infected 
Col-0 plants (Figure IC). The estimated size of the most 
abundant sense RNA corresponds to that of the 
5'-cleavage product of TAS3a pri-RNA mapped 



previously by 5'- and 3'-RACE (4) (rapid amplification 
of complementary DNA ends) (Figure lA). 

In antisense polarity, a very abundant RNA of ~230nt 
and less abundant, shorter (~200 nt) and longer (~250 nt) 
RNAs accumulated in TAV-D4 plants (Figure IC). 
The 230-nt and 200-nt RNAs were also detected in 
Col-0 and La-er plants, albeit at much lower levels. 
Genetic evidence revealed that both of these antisense 
RNAs are RDR6-dependent, unlike the 460-nt sense 
RNA, they did not accumulate in rdr6-15 null mutant 
plants (Figure IC). The 230-nt (but not 200-nt) antisense 
RNA accumulated in triple null mutant plants lacking 
DCL2, DCL3 and DCL4 activities [dcl2/3/4 in Col-0 
background; (7)] (Supplementary Figure SI A). This indi- 
cates that the 230-nt RNA is a complete RDR6 transcript 
and hence the complementary strand of TAS3a-denved 
major dsRNA. The 200-nt RNA may represent a comple- 
mentary strand of a shorter dsRNA produced via a sec- 
ondary, in cis cleavage of TAS3a pri-RNA by TAS3a 
siRNA 5'D2(— ) which was reported previously (1). Both 
antisense RNAs did not accumulate in AG07 null mutant 
plants (Supplementary Figure SIA), thus confirming an 
essential role of AG07 in the biogenesis of dsRNA pre- 
cursors of tasiARFs. Taken together, our results suggest 
that the major dsRNA derived from TAS3a gene is an 
asymmetrical duplex composed of ~450-nt sense RNA 
(miR390-cleavage product of TAS3a pri-RNA) and 
~230-nt antisense RNA (RDR6 transcript). 

To precisely map the RDR6 transcript, we used a cRT- 
PCR method that enables simultaneous cloning and 
sequencing of both 5'- and 3'-terminal regions of a 
given RNA [(13); schematically shown in Supplementary 
Figure SIB]. Since RDR6 activity does not require a 
primer (15), the 5'-terminal nucleotide of an RDR6 
transcript is likely 5'-triphosphorylated. To circularize 
such a transcript, prior to RNA hgation, total plant 
RNA was treated with pyrophosphatase which con- 
verts a 5'-polyphosphorylated or capped nucleotide to 
a 5'-nionophosphorylated nucleotide. The cRT-PCR 
primers proximal to the termini of r^5ia-derived major 
antisense RNA were designed based on the size estimated 
by blot hybridization (Figure IC). The major cRT-PCR 
products of expected size obtained from TAV-D4 trans- 
genic and CaMV-infected dcl2l3l4 plant samples were 
gel-purified, cloned and individual clones were sequenced. 

For both samples, cRT-PCR analysis (Supplementary 
Figure SIC; summarized in Figure IB) revealed that 



Figure 1. Continued 

and minor transcription start sites mapped previously [by Howell et al. (4)] are indicated by the big and small broken arrows, respectively. 
The numbering is from the 5'-end of the longest cDNA representing TAS3a pri-RNA. The miR390 basepairing to the non-cleavable and cleavable 
sites are shown and position of the expected cleavage is indicated with open or filled arrows, respectively. (B) cRT-PCR mapping of the TAS3a sense 
and antisense RNAs (Supplementary Figure SIC for details). Sequences of TAS3a gene surrounding the transcription start and two miR390 target 
sites are shown in sense and antisense orientation. The numbering is as in (A). The termini of sequenced cRT-PCR clones are indicated by arrows 
above the sequence for sense RNAs (a total of 45 clones) and below the sequence for antisense RNAs (a total of 44 clones). The number of clones is 
given when more than one clone had the same 5'- or 3'-terminus. Additional clones that map outside of the shown sequences are indicated by grey 
arrows with a total number of the clones given in brackets above the sense strand and below the antisense strand. (C) RNA blot hybridization 
analysis of TASSa sense and antisense RNAs accumulating in CaMV-infected (+) or mock inoculated (— ) wild-type (Col-0) and rdr6 mutant 
(rdr6-15) plants or plants transgenic for CaMV TAV protein (TAV-D4). The polyacrylamide gel (PAAG) was composed of 5% upper part and 
15% lower part as indicated. Two blots shown on the left and on the right were hybridized successively with probes specific to sense and antisense 
strands of the TASSa 5'D7 siRNA duplex. The 21-nt guide stand of this duplex accumulates in wild-type but not rclr6 mutant plants (indicated by 
arrow). Positions of single-stranded RNA markers ranging from 100 nt to 1000 nt are indicated. The 5'-cleavage product of TASSa pri-RNA and the 
major antisense RNA (RDR6 product) are indicated by blue and magenta arrows, respectively. rRNA stained with EtBr is shown as loading control. 
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the major antisense RNA begins predominantly witii G469 
[the numbering is from a 5'-terminal nucleotide 
of the longest cDNA representing a spliced TAS3a 
pri-RNA (4); Figure lA]: 34 of 44 clones mapped to this 
position, while only 6 clones mapped to A468. Hence, 
RDR6 initiates complementary RNA synthesis predomin- 
antly at the third nucleotide from the 3'-terminus of the 
niiR390-AGO7-cleaved TAS3a pri-RNA. This creates a 
2-nt 3'-overhang on the resulting dsRNA. A similar 
property was reported for an RDR activity isolated from 
tomato plants: in vitro, it initiated synthesis of complemen- 
tary RNA at either the second or the third nucleotide from 
artificial RNA template's 3'-hydroxylated terminus (16). 

The 3'-termini of cloned antisense RNAs were more 
heterogeneous, but 29 of 44 clones clustered in a close 
vicinity of the miR390 non-cleavable target site. This 
location fits the estimated size of the major antisense 
RNA detected by blot hybridization. Two major fractions 
of the antisense RNA terminate with A237 (12 clones) and 
A238 (7 clones), while two minor fractions terminate with 
A236 (2 clones) and U240 (4 clones) (Figure IB). Thus, 
the predominant 3'-terminal nucleotide of the RDR6 tran- 
script (A237) is complementary to the last nucleotide of 
the miR390 non-cleavable target sequence, which can po- 
tentially pair with the 5'-terminal nucleotide of niiR390 
(Figure lA). Based on crystal structures of Thermus 
thermophilus AGO-guide strand complexes (17), it can 
be envisaged that the 5'-phosphate of miR390 is seques- 
tered in the binding pocket of AG07 protein and there- 
fore, the 5'-terminal nucleotide cannot pair with the 
complementary nucleotide of the target site. Hence, this 
complementary nucleotide can potentially be copied by 
RDR6 without displacement of the miR390/template 
RNA complex. Only a minor fraction of the RDR6 tran- 
script terminates at A236, pairing with the second nucleo- 
tide of miR390, whereas, the second largest fraction 
terminates just in front of the target site (A238). We 
conclude that RDR6-mediated synthesis of complemen- 
tary RNA is effectively terminated by the miR390- 
AG07 complex stably bound to the target site 
(Figure lA). RDR6 appears to be unable to displace this 
complex and proceed with synthesis of complementary 
RNA further upstream, towards the template's 5'-end. 

We next used cRT-PCR with reverse primers in order 
to precisely map TAS3a sense RNAs. The design of cRT- 
PCR with the RT primer located about 40 nt downstream 
of the miR390 non-cleavable site (Supplementary Figure 
SIC) allowed us not only to pick up the previously 
mapped, 5'-cleavage product of pri-RNA (the most 
abundant RNA of ~450 nt, stabilized by CaMV TAV) 
but also to analyze less abundant, shorter RNAs in the 
size range starting from ~200 nt. Note that this design was 
biased towards the shorter RNAs, despite their lower 
abundance. Confirming the previous 5'-RACE mapping 
results (4), our analysis (Supplementary Figure SIC; 
summarized in Figure IB) revealed that the 5'-cleavage 
product of TAS3a pri-RNA, i.e. the presumptive 
template for RDR6, starts predominantly at A16, i.e. at 
the major transcription start site located 16 nt downstream 
of the most distal, minor transcription start site (Figure 
lA). Both, the 5'-cleavage product and shorter TAS3a 



sense RNAs terminate predominantly at C471 (34 of 45 
clones), which corresponds to the miR390-directed 
cleavage between nucleotides 10 and 11 from the 
miR390 5'-end (Figure lA). A small fraction of sense 
RNA terminated at either T468 (1 clone), or C469 
(4 clones), or C470 (6 clones) (Figure IB and 
Supplementary Figure SIC). The RNA terminating with 
C470 could be a template for the minor fraction of RDR6 
transcript starting with A468 which is complementary to 
the third nucleotide from this template's end. 

Analysis of the TAS3a-dQn\ed shorter sense RNAs 
revealed that a large fraction of their 5'-termini fall hetero- 
geneously in two clusters just downstream of the miR390 
non-cleavable target site and a smaller fraction of the 
5'-termini fall within or upstream of the target site 
sequence (Figure IB and Supplementary Figure SIC). 
These results are consistent with the previous 5'-RACE 
mapping (4) showing no evidence for a precise, 
miR390-directed cleavage at this target site. Together 
with blot hybridization, cRT-PCR data also indicate 
that a fraction of the 5'-cleavage product of TAS3a 
pri-RNA is gradually truncated from the 5'-end before 
or after RDR6-mediated production of the asymmetrical 
dsRNA. A regular ladder of the truncated antisense 
RNAs (Figure IC) points at involvement of a non- 
randomly terminating 5'-3'-exonuclease. 

Taken together, our genetic analysis, RNA blot hybrid- 
ization and cRT-PCR data suggest the following model 
for the biogenesis of dsRNA precursor of TAS3a 
tasiRNAs (Figure lA): Two AGO7-miR390 complexes 
bind TAS3a pri-RNA at the upstream (non-cleavable) 
and downstream (cleavable) target sites and the down- 
stream complex cleaves the target sequence between nu- 
cleotides 10 and 11 from the miR390 5'-end, followed by 
dissociation of this complex. The 5'-cleavage product of 
pri-RNA associated with the upstream complex serves as a 
template for RDR6. RDR6-mediated synthesis of comple- 
mentary RNA is initiated at the third nucleotide from the 
template's 3'-end and terminated just in front of the 
miR390-AGO7 complex bound at the non-cleavable 
site. In most cases, the last nucleotide incorporated 
into the RDR6 transcript is complementary either to the 
template nucleotide located just downstream to the target 
sequence or to the last nucleotide of the target sequence. 
Only a minor fraction of RDR6 transcript is terminated 
one nucleotide further downstream, which would require 
partially melting the miR390/template RNA duplex. 
Thus, RDR6 appears to be unable to displace the 
miR390-AGO7 complex from the non-cleavable target 
site. The resulting dsRNA precursor of tasiARFs has an 
asymmetrical structure with two overhangs of the sense 
(template) strand: the long, 220-nt 5'-overhang and the 
short, 2-nt 3'-overhang (Figure lA). 

Interestingly, RNA duplexes with long 5'-overhangs 
are preferential in vitro binding substrates for SGS3 (18), 
the protein essential for tasiRNA biogenesis. The role 
of SGS3 in tasiRNA biogenesis is not known yet, 
although genetic evidence suggested that it stabilizes 
miRNA-cleaved pri-RNA (2). 

Only the 2-nt 3'-overhang but not the long 5'-overhang 
is compatible with DCL4 binding and in-phase 
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processing of the asymmetric TAS3a dsRNA. Indeed, 
tasiARFs are encoded in the phase register set by 
miR390-directed cleavage. A 2-nt 3'-overhang is optimal 
for binding and processing miRNA precursors by recom- 
binant hmnan Dicer in vitro (19). Furthermore, the 2-nt 
3'-overhang fits into the PAZ (Piwi/Argonaute/Zwille) 
domain binding pocket of Giardia Dicer (20). Four 
Arabidopsis DCLs including DCL4 possess a conserved 
PAZ domain (21,22) that might be involved in recognition 
of dsRNA substrates with 2-nt 3'-overhangs. Our parallel 
study of dsRNA precursors of TASla/b/c- and TAS2- 
derived tasiRNAs revealed that these dsRNAs have 1- 
or 2-nt 3'-overhangs which make them compatible with 
DCL4 processing from both ends (R. Rajeswaran et al., 
submitted for publication). In the case of TAS3a, and pre- 
sumably TAS3b and TAS3c, both the long 5'-overhang of 
dsRNA precursor and the miR390-AGO7 complex that 
may remain associated with this overhang would prevent 
DCL4 binding, thus ensuring the correct processing of 
phased tasiRNAs only from the opposite terminus. 

Stable association of miR390 with its non-cleavable 
target site on TAS3 pri-RNA is further supported by a 
'target mimicry' phenomenon, in which miR399 is effect- 
ively sequestered from cleavage-mediated silencing of its 
target niRNA by stable interaction with a non-cleavable, 
decoy target RNA (23). Notably, in both cases the pairing 
of a miRNA and a target RNA is interrupted by a mis- 
matched loop at the expected miRNA cleavage site. 

We demonstrate here that a non-cleavable interaction of 
miR390 and TAS3a pri-RNA terminates RDR6 synthesis 
and thereby, protects the sequences upstream of the 
miR390 binding site from being converted to dsRNA. 
This precludes production of additional siRNAs from 
the upstream region, which may potentially cause negative 
effects. The phenomenon of transitivity or spread of 
secondary siRNA production along miRNA/siRNA 
target genes has been described in plants (24) and 
Caenorhabditis elegans (25). In multigene families contain- 
ing a miRNA target gene the transitivity can potentially 
lead to production of secondary siRNAs from the 
conserved region(s) with high nucleotide sequence similar- 
ity and therefore result in off-target cleavage and silencing 
of all the family members. In this case, a second, 
non-cleavable target site for miRNA/siRNA, which 
is located upstream of the cleavage site, would 
confine RDR-dependent siRNA production just to the 
short intervening region. This represents an additional 
function of the two-hit trigger mechanism of tasiRNA 
biogenesis proposed for TAS3 family genes and other 
genes spawning RDR6-dependent siRNAs (4,6). 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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